Cyanogen Bromide (CNBr) Cleavage 
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1 


1 

ERVLQGLLKP 


LFKSTSVGPL 


YSG 


2 


ERVLQGLLKP 


LFKSTSVGPL 


YSG 


3 


ERVLQGLLGP 


MFKNTSVGLL 


YSG 


4 


ERVLQGLLRP 


VFKNTSVGPL 


YSG 


5 


ERVLQGLLSP 


IFKNSSVGSL 


YSG 


6 


ERVLQGLLKP 


LFKSTSVGPL 


YSG 




50 
YHPNP 
TLRLDP 
YRPDP 
jTYRPDP 
HRPDP 
HRLDP 



1 
2 
3 
4 
5 
6 



51 

KRPGLDREQL 
TGPGLDRERL 
KSPGLDREQL 
KSPGLDREQL 
KSPGLDRERL 
LNPGLDREQL 



101 

1 PGTSTVYWAT 

2 PGTSAVHLET 

3 PGTPTVDLGT 

4 PGTPTVDLGT 

5 PDTSTMHLAT 

6 PDTSTMHLAT 



YCELSQLTHD 
YWELSQLTNS 
YWELSQLTHS 
YWELSQLTHS 
YWKLSQLTHG 
YWELSKLTRG 



TGTPSSFPGH 
SGTPASLPGH 
SGTPVSKPGP 
SGTPVSKPGP 
SRTPASLSGP 
SRTPASLSGP 



ITELGPYSLD 
VTELGPYTLD 
ITELGPYTLD 
ITELGPYTLD 
ITELGPYTLD 
IIELGPYLLD 



TEPGPLLIPF 
TAPGPLLVPF 
SAASPLLIPF 
SAASPLLVPF 
TTASPLLVLF 
TTASPLLIPF 



RDSLYVNGFT 
RDSLYVNGFT 
RDSLYVNGFT 
RDSLYVNGFT 
RHSLYVNGFT 
RGSLYVNGFS 



TFNFTITNLH 
TLNFTITNLQ 
TINFTITNLR 
TLNFTITNLQ 
TINFTITNQR 
TLNFTITNLQ 



100 

HQNSVPTTST 
HRSSVPTTSI 
QRSSVPTTSI 
QRSSVPTTSI 
HQSSMTTTRT 
RQSSMTTTRT 

150 

YEENMQHPGS 
YEEDMRHPGS 
YEENMGHPGS 
YEEDMHRPGS 
YEENMHHPGS 
YEENMGHPGS 



151 

1 RKFNTTERVL 

2 RKFNTTERVL 

3 RKFNIMERVL 

4 RKFNATERVL 

5 RKFNTTERVL 

6 RKFNIMERVL 

201 

1 THRVDPIGPG 

2 THRLDPLNPG 

3 THRLDPKSPG 

4 LYHPNPKRPG 

5 TYRPDPKSPG 

6 LYHPNPKRPG 

251 

1 VPTTSTPGTS 

2 VPITSTPGTS 

3 VSTTSTPGTS 

4 MTTTRTPDTS 

5 VPTTSIPGTS 

6 VAPTSTPGTS 



200 



QGLLKPLFKN 
QGLLKPLFKS 
QGLLKPLFKN 
QGLLSPIFKN 
QGLLRPVFKN 
QGLLNPIFKN 



LDRERLYWEL 
LDREQLYWEL 
LNREQLYWEL 
LDREQLYWEL 
LDREQLYWEL 
LDREQLYWEL 



TSVGPLYSGl 
TSVGPLYSGI 
TSVGPLYSGl 
SSVGPLYSG! 
TSVGPLYSG; 
SSVGPLYSG! 



SQLTNSIHEL 
SKLTRGIIEL 
SKLTNDIEEL 
SQLTHNITEL 
SQLTHSITEL 
SQLTHGIKEL 




GPYTLDRDSL 
GPYLLDRGSL 
GPYTLDRNSL 
GPYSLDRDSL 
GPYTQDRDSL 
GPYTLDRNSL 



250 

YVNGFNPRSS 
YVNGFTHRNF 
YVNGFTHQSS 
YVNGFTHQSS 
YVNGFTHRSS 
YVNGFTHRSS 



TVHLATSGTP SSL . . PGHTA 
TVHLGTSETP SSL . . PRPIV 
TVDLRTSGTP SSLSSPTIMA 
TMHLATSRTP ASLSGPT . . T 

AVHLETSGTP ASL. . P— 

TVDLGTSGTP SSL..PSPTT AVPLLIPF 



288 
PVPLLI — 
PGPLLIPF 
AGPLLIPF 
ASPLLIPF 



Figure 3 



Our Sequence (95-76) 



ERVLQGPLSP IFKNSSVGPL YSG^^^^^^^^^^j^LYHPNP 
KRPGLDREQL YWELSQLTHN ITELGPYSLD RDSLYVNGFT HQNSVPTTST 
PGTSTVYWAT TGTPSSFPGH TEPGPLLI PF TLNFTITNLQ YEENMGHPGS 
RKFNITERVL QGLLNPIFKN SSVGPLYSG^^^^^^^^^^^^^p 
LYHPNPKRPG LDREQLYCEL SQLTHNITEL GPYSLDRDSL YVNGFTHQNS 
VPTTSTPGTS TVYWATTGTP SSFPGHTEPG PLLIPFTLNF TITNLQYEED 
MRRTGSRKFN TMERVLQGLL KPLFKSTSVG PLYSG|g^S^^^^^^ 
ilS^H TLRL DPTGPGLDRE RLYWELSQLT NSVTELGPYT LDRDSLYVN 
GFTHRSSVPT TSIPGTSAVH LETSGTPASL PGHTAPGPLL VP FT LN FT IT 
NLQYEEDMRH PGSRKFNTTE RVLQGLLKPL FKSTSVGPLY SGjj^^M^ 
^^^^^g^THRLDPL NPGLDREQLY WELSKLTRGI IELGPYLLDR 
G S L YVNG FT H X "rNFV P I T S T P GTSTVHLGTS ETPSSLPRPI VPGPLLIPF 
TLNFTITNLQ YEENMGHPGS RKFNITERVL QGLLKPLFRN SSLEYLYSGf 
^g^^^^^^M^jgj THRPDPEDLG LDRERLYWEL SNLTNGIQEL 
GPYTLDRNSL YVNGFTHRSS MPTTSTPGTS TVDVGTSGTP SSSPSPTTAG 
PLLMPFTLNF TITNLQYEED MRRTGSRKFN TMESVLQGLL KPLFKNTSVG 
PLYSG^^^B^^^M^PBS THRL DPKSPGLNRE QLYWELSKLT 
NDIEEVGPYT LDRNSLYVNG FTHRSFVAPT STLGTSTVDL GTSGTPSSLP 
SPTTGVPLLI PFTLNFTITN LQYEENMGHP GSRKFNIMER VLQGLLSPIF 
KNSSVGSLYS 



■jAI^KialTHRPDPKS PGLDRERLYW 
KLSQLTHGII ELG P YTLDR HS FYVNG FTH QSSMTTTRTP DTSTMHLATS 
RTPASLSGPT TASPLLVLFT INFTITNQRY EENMHHPGSR KFNTTERVLQ 
GLLRPVFKNT SVGPLYSG^ Mi^^M^^^ Sg^T YRPDPKSPGL 
DREQLYWELS QLTHSITELG PYTQDRDSLY VNGFTHRSSV PTTSIPGTSA 
VHLETSGTPA SLP 



Figure 4a 



AK024365 



MPLFKNTSVS SLYSGl^S^ffiM^M*^ 11 ^ &PKSPGLDRE 
RLYWKLSQLT HGIIELGPYT LDRHSFYVNG FTHQSSMTTT RTPDTSTMHL 
ATSRTPASLS GPTTASPLLV LFTINFTITN QRYEENMHHP GSRKFNTTER 
VLQGLLRPVF KNTSVGPLYS G^S^B^^»TYRPDPKS 
PGLDREQLYW ELSQLTHSIT ELGPYTQDRD SLYVNGFTHR SSVPTTSIPG 
TSAVHLETSG TPASLPGPSA AS PLLVLFTL NFTITNLRYE ENMQHPGSRK 
FNTTERVLQG LLRSLFKSTS VGPLYSGg^^^^^^^^^^TH 
HPDPKSPRLD REQLYWELSQ LTHNITELGH YALDNDSLFV NGFTHRSSVS 
TTSTPGTPTV YLGASKTPAS IFGPSAASHL LILFTLNFTI TNLRYEENMW 
PGSRKFNTTE RVLQGLLRPL FKNTSVGPLY SGg^^^^^^J^^j 
ggTHRPDPT GPGLDREQLY LELSQLTHSI TELGPYTLDR DSL YVNG FTH 
RSSVPTTSTG VVSEEPFTLN FTINNLRYMA DWGQPGSLKF NITDNVMKHL 
LSPLFQRSSL GARYTG^^^^^^^^1|TYL QPLSGPGLPI 
KQVFHELSQQ THGITRLGPY SLDKDSLYLN GYNEPGLDEP PTTPKPATTF 
LPPLSEATTA MGYHLKTLTL NFTISNL QYS PDM GKGS ATF NSTEGVLQHL 
LRPLFQKSSM GPFYLGp^^M^^^Wl.TYH PDPVGPGLDI 
QQLYWELSQL THGVTQLGFY VLDRDSLFIN GYAPQNLSIR GEYQINFHIV 
NWNLSNPDPT SSEYITLLRD IQDKVTTLYK GSQLHDT FRF CLVTNLTMDS 
VLVTVKALFS SNLDPSLVEQ VFLDKTLNAS FHWLGSTYQL VDIHVTEMES 
SVYQPTSSSS TQHFYPNFTI TNLPYSQDKA QPGTTNYQRN KRNIEDALNQ 
LFRNSSIKSY FSDCQVSTFR SVPNRHHTGV DSLCNFSPLA RRVDRVAIYE 
EFLRMTRNGT QLQNFTLDRS SVLVDGYSPN RNEPLTGNSD LPFWAVIFIG 
LAGLLGLITC LICGVLVTTR RRKKEGEYNV QQQCPGYYQS HLDLEDLQ 



Figure 4b 



ERVLQGPLSP 
51 

KRPGLDREQL 
101 

PGTSTVYWAT 
151 

RKFNITERVL 
201 

LYHPNPKRPG 
251 

VPTTSTPGTS 
301 

MRRTGSRKFN 

MlTLRL 
401 

FTHRSSVPTT 
451 

LQYEEDMRHP 
501 

551 
SLYVNGFTHR 
601 

NFTITNLQYE 

7oT ~ 
YTLDRNSLYV 
751 

LMPFTLNFTI 
851 

YSG^iwii 

951 

IEEVGPYTLD 
1001 

TTGVPLLIPF 
1051 

SSVGSLYS( 
1101 

SQLTHGIIEL 
1151 

ASLSGPTTAS 
1201 

RPVFKNTSVG 
1251 

QLYWELSQLT 
1301 

ETSGTPASLP 
1351 

VLQGLLRSLF 
1401 

PRLDREQLYW 
1451 

TPTVYLGASK 
1501 



IFKNSSVGPL 
YWELSQLTHN 
TGTPSSFPGH 
QGLLNPIFKN 
LDREQLYCEL 
TVYWATTGTP 
TMERVLQGLL 
DPTGPGLDRE 
SIPGTSAVHL 
GSRKFNTTER 
HRLDPLN 
NFVPITSTPG 
ENMGHPGSRK 

NGFTHRSSMP 
TNLQYEEDMR 



RNSLYVNGFT 
TLNFTITNLQ 

GPYTLDRHSF 
PLLVLFTINF 
PLYSGJ 
HSITELGPYT 
GPSAASPLLV 
KSTSVGPLYS 
ELSQLTHNIT 
TPASIFGPSA 



YSG] 

ITELGPYSLD RDSLYVNGFT 
TEPGPLLIPF TLNFTITNLQ 
SSVGPLYSG§|§| 

sqlthnitel gpysldrdsl 
ssfpghtepg pllipftlnf 
kplfkstsvg plys 
rlywelsqlt nsvtelgpyt 
etsgtpaslp ghtapgpllv 
vlqgllkplf kstsvgplys 
pgldreqlyw elskltrgii 
tstvhlgtse tpsslprpiv 
fnitervlqg llkplfrnss 
rpdpedlgld rerlywelsn 
ttstpgtstv dvgtsgtpss 
rtgsrkfntm esvlqgllkp 
^Pthrldp kspglnreql 
hrsfvaptst lgtstvdlgt 
yeenmghpgs rkfnimervl 

THRPDPKSPG 
YVNGFTHQSS MTTTRTPDTS 
TITNQRYEEN MHHPGSRKFN 
|§TYRP 
QDRDSLYVNG FTHRSSVPTT 
LFTLNFTITN LRYEENMQHP 

ELGHYALDND SLFVNGFTHR 
ASHLLILFTL NFTITNLRYE 



50 

SLYHPNP 
100 

HQNSVPTTST 
150 

YEENMGHPGS 
200 



250 

YVNGFTHQNS 
300 

TITNLQYEED 
350 

400 

LDRDSLYVNG 
450 

PFTLNFTITN 
500 

550 

ELGPYLLDRG 
600 

PGPLLIPFTL 
650 

LEYLYSG§§|| 
700 

LTNGIQELGP 
750 

SPSPTTAGPL 
800 

LFKNTSVGPL 
900 

YWELSKLTND 
1000 
SGTPSSLPSP 
1050 
QGLLSPIFKN 
1100 
LDRERLYWKL 
1150 
TMHLATSRTP 
1200 
TTERVLQGLL 
1250 
DPKSPGLDRE 
1300 
SIPGTSAVHL 
1350 
GSRKFNTTER 
1400 
'HHPDPKS 
1450 
SSVSTTSTPG 
1500 
ENMWPGSRKF 
1550 



Figure 5a 



NTTERVLQGL 
1551 

PDPTGPGLDR 
1601 

TSTGWSEEP 
1651 

RSSLGARYTG 
1701 

LSQQTHGITR 
1751 

ATTAMGYHLK 
1801 

KSSMGPFYLG 
1851 

LSQLTHGVTQ 
1901 

PDPTSSEYIT 
1951 

ALFSSNLDPS 
2001 

SSSSTQHFYP 
2051 

IKSYFSDCQV 
2101 

RNGTQLQNFT 
2151 

LITCLICGVL 



LRPLFKNTSV 
EQLYLELSQL 
FTLNFTINNL 

LGPYSLDKDS 
TLTLNFTISN 

LGFYVLDRDS 
LLRDIQDKVT 
LVEQVFLDKT 
NFTITNLPYS 
STFRSVPNRH 
LDRSSVLVDG 
VTTRRRKKEG 



GPLYSG^m 

THSITELGPY 

RYMADMGQPG 

LYLNGYNEPG 
LQYSPDMGKG 

LFINGYAPQN 
TLYKGSQLHD 
LNASFHWLGS 
QDKAQPGTTN 
HTGVDSLCNF 
YSPNRNEPLT 
EYNVQQQCPG 



TLDRDSLYVN 
SLKFNITDNV 

ITYLQPLSGP 
LDEPPTTPKP 
SATFNSTEGV 

TYHPDPVGP 

LSIRGEYQIN 

TFRFCLVTNL 

TYQLVDIHVT 

YQRNKRNIED 

SPLARRVDRV 

GNSDLPFWAV 
2 

YYQSHLDLED 



1600 
GFTHRSSVPT 
1650 
MKHLLSPLFQ 
1700 
GLPIKQVFHE 
1750 
ATTFLPPLSE 
1800 
LQHLLRPLFQ 
1850 
GLDIQQLYWE 
1900 
FHIVNWNLSN 
1950 
TMDSVLVTVK 
2000 
EMESSVYQPT 
2050 
ALNQLFRNSS 
2100 
AIYEEFLRMT 
2150 
IFIGLAGLLG 
192 
LQ 



Figure 5a (cont) 



1 GAGAGGGTTC TGCAGGGTCC GCTTAGTCCC ATATTCAAGA ACTCCAGTGT 

51 TGGCCCTCTG TACTCTGGCT GCAGACTGAC CTCTCTCAGG CCCGAGAAGG 

101 ATGGGGCAGC AACTGGAATG GATGCTGTCT GCCTCTACCA CCCTAATCCC 

151 AAAAGACCTG GGCTGGACAG AGAGCAGCTG TACTGGGAGC TAAGCCAGCT 

201 GACCCACAAC ATCACTGAGC TGGGCCCCTA CAGCCTGGAC AGGGACAGTC 

251 TCTATGTCAA TGGTTTCACC CATCAGAACT CTGTGCCCAC CACCAGTACT 

301 CCTGGGACCT CC AC AGTGT A CTGGGCAACC ACTGGGACTC CATCCTCCTT 

351 CCCCGGCCAC ACAGAGCCTG GCCCTCTCCT GATACCATTC ACGCTCAACT 

401 TCACCATCAC TAACCTACAG TATGAGGAGA ACATGGGTCA CCCTGGCTCC 

4 51 AGGAAGTTCA ACATCACGGA GAGGGTTCTG CAGGGTCTGC TTAATCCCAT 

501 TTTCAAGAAC TCCAGTGTTG GCCCTCTGTA CTCTGGCTGC AGACTGACCT 

551 CTCTCAGGCC CGAGAAGGAT GGGGCAGCAA CTGGAATGGA TGCTGTCTGC 

601 CTCTACCACC CTAATCCCAA AAGACCTGGG CTGGACAGAG AGCAGCTGTA 

651 CTGCGAGCTA AGCCAGCTGA CCCACAACAT CACTGAGCTG GGCCCCTACA 

701 GCTTGGACAG GGACAGTCTT TATGTCAATG GTTTCACCCA TCAGAACTCT 

751 GTGCCCACCA CCAGTACTCC TGGGACCTCC ACAGTGTACT GGGCAACCAC 

801 TGGGACTCCA TCCTCCTTCC CCGGCCACAC AGAGCCTGGC CCTCTCCTGA 

851 TACCATTCAC CCTCAACTTC ACCATCACCA ACCTGCAGTA CGAGGAGGAC 

901 ATGCGTCGCA CTGGCTCCAG GAAGTTCAAC ACCATGGAGA GGGTTCTGCA 

951 GGGTCTGCTC AAGCCCTTGT TCAAGAGCAC CAGCGTTGGC CCTCTGTACT 

1001 CTGGCTGCAG ACTGACCTTG CTCAGACCTG AGAAACATGG GGCAGCCACT 

1051 GGAGTGGACG CCATCTGCAC CCTCCGCCTT GATCCCACTG GTCCTGGACT 

1101 GGACAGAGAG CGGCTATACT GGGAGCTGAG CCAGCTGACC AACAGCGTTA 

1151 CAGAGCTGGG CCCCTACACC CTGGACAGGG ACAGTCTCTA TGTCAATGGC 

1201 TTCACCCATC GGAGCTCTGT GCCAACCACC AGTATTCCTG GGACCTCTGC 

1251 AGTGCACCTG GAAACCTCTG GGACTCCAGC CTCCCTCCCT GGCCACACAG 

1301 CCCCTGGCCC TCTCCTGGTG CCATTCACCC TCAACTTCAC TATCACCAAC 

1351 CTGCAGTATG AGGAGGACAT GCGTCACCCT GGTTCCAGGA AGTTCAACAC 

14 01 CACGGAGAGA GTCCTGCAGG GTCTGCTCAA GCCCTTGTTC AAGAGCACCA 



14 51 GTGTTGGCCC TCTGTACTCT GGCTGCAGAC TGACCTTGCT CAGGCCTGAA 

1501 AAACGTGGGG CAGCCACCGG CGTGGACACC ATCTGCACTC ACCGCCTTGA 

1551 CCCTCTAAAC CCTGGACTGG ACAGAGAGCA GCTATACTGG GAGCTGAGCA 

1601 AACTGACCCG TGGCATCATC GAGCTGGGCC CCTACCTCCT GGACAGAGGC 

1651 AGTCTCTATG TCAATGGTTT CACCCATCGG AACTTTGTGC CCATCACCAG 

1701 CACTCCTGGG ACCTCCACAG TACACCTAGG AACCTCTGAA ACTCCATCCT 

1751 CCCTACCTAG ACCCATAGTG CCTGGCCCTC TCCTGATACC ATTCACACTC 

1801 AACTTCACCA TCACTAACCT ACAGTATGAG GAGAACATGG GTCACCCTGG 

1851 CTCCAGGAAG TTCAACATCA CGGAGAGGGT TCTGCAGGGT CTGCTCAAAC 

1901 CCTTGTTCAG GAATAGCAGT CTGGAATACC TCTATTCAGG CTGCAGACTA 

1951 ACCTCACTCA GGCCAGAGAA GGATAGCTCA ACCATGGCAG TGGATGCCAT 

2001 CTGCACACAT CGCCCTGACC CTGAAGACCT CGGACTGGAC AGAGAGCGAC 

2051 TGTACTGGGA GCTGAGCAAT CTGACAAATG GCATCCAGGA GCTGGGCCCC 

2101 TACACCCTGG ACCGGAACAG TCTCTATGTC AATGGTTTCA CCCATCGAAG 

2151 CTCTATGCCC ACCACCAGCA CTCCTGGGAC CTCCACAGTG GATGTGGGAA 

2201 CCTCAGGGAC TCCATCCTCC AGCCCCAGCC CCACGACTGC TGGCCCTCTC 

2251 CTGATGCCGT TCACCCTCAA CTTCACCATC ACCAACCTGC AGTACGAGGA 

2301 GGACATGCGT CGCACTGGCT CCAGGAAGTT CAACACCATG GAGAGTGTCC 

2351 TGCAGGGTCT GCTCAAGCCC TTGTTCAAGA ACACCAGTGT TGGCCCTCTG 

2401 TACTCTGGCT GCAGATTGAC CTTGCTCAGG CCCAAGAAAG ATGGGGCAGC 

24 51 CACTGGAGTG GATGCCATCT GCACCCACCG CCTTGACCCC AAAAGCCCTG 

2501 GACTCAACAG GGAGCAGCTG TACTGGGAGT TAAGCAAACT GACCAATGAC 

2551 ATTGAAGAGG TGGGCCCCTA CACCTTGGAC AGGAACAGTC TCTATGTCAA 

2601 TGGTTTCACC CATCGGAGCT TTGTGGCCCC CACCAGCACT CTTGGGACCT 

2651 CCACAGTGGA CCTTGGGACC TCAGGGACTC CATCCTCCCT CCCCAGCCCC 

27 01 ACAACAGGTG TTCCTCTCCT GATACCATTC ACACTCAACT TCACCATCAC 

27 51 TAACCTACAG TATGAGGAGA ACATGGGTCA CCCTGGCTCC AGGAAGTTCA 

2801 ACATCATGGA GAGGGTTCTG CAGGGTCTGC TTATGCCCTT GTTCAAGAAC 



Figure 5b (cont) 



2851 


ACCAGTGTCA 


GCTCTCTGTA CTCTGGTTGC 


AGACTGACCT 


TGCTCAGGCC 


2901 


TGAGAAGGAT 


GGGGCAGCCA CCAGAGTGGT 


TGCTGTCTGC 


ACCCATCGTC 


2951 


CTGACCCCAA 


AAGCCCTGGA CTGGACAGAG 


AGCGGCTGTA 


CTGGAAGCTG 


3001 


AGCCAGCTGA 


CCCACGGCAT CACTGAGCTG 


GGCCCCTACA 


CCCTGGACAG 


3051 


GCACAGTCTC 


TATGTCAATG GTTTCACCCA TCAGAGCTCT 


ATGACGACCA 


3101 


CCAGAACTCC 


TGATACCTCC ACAATGCACC 


TGGCAACCTC 


GAGAACTCCA 


3151 


GCCTCCCTGT 


CTGGACCTAC GACCGCCAGC 


CCTCTCCTGA 


TACCATTCAC 


3201 


AATTAACTTC 


ACCATCACTA ACCTGCGGTA 


TGAGGAGAAC 


ATGCATCACC 


3251 


CTGGCTCTAG 


AAAGTTTAAC ACCACGGAGA 


GAGTCCTTCA 


GGGTCTGCTC 


3301 


AGGCCTGTGT 


TCAAGAACAC CAGTGTTGGC 


CCTCTGTACT 


CTGGCTGCAG 


3351 


ACTGACCTTG 


CTCAGGCCCA AGAAGGATGG 


GGCAGCCACC 


AAAGTGGATG 


3401 


CCATCTGCAC 


CTACCGCCCT GATCCCAAAA 


GCCCTGGACT 


GGACAGAGAG 


3451 


CAGCTATACT 


GGGAGCTGAG CCAGCTAACC 


CACAGCATCA 


CTGAGCTGGG 


3501 


CCCCTACACC 


CTGGACAGGG ACAGTCTCTA 


TGTCAATGGT 


TTCACACAGC 


3551 


GGAGCTCTGT 


GCCCACCACT AGCATTCCTG 


GGACCCCCAC 


AGTGGACCTG 


3601 


GGAACATCTG 


GGACTCCAGT TTCTAAACCT 


GGTCCCTCGG 


CTGCCAGCCC 


3651 


TCTCCTGGTG 


CTATTCACTC TCAACTTCAC 


CATCACCAAC 


CTGCGGTATG 


3701 


AGGAGAACAT 


GCAGCACCCT GGCTCCAGGA 


AGTTCAACAC 


CACGGAGAGG 


3751 


GTCCTTCAGG 


GCCTGCTCAG GTCCCTGTTC AAGAGCACCA GTGTTGGCCC 


3801 


TCTGTACTCT 


GGCTGCAGAC TGACTTTGCT 


CAGGCCTGAA 


AAGGATGGGA 


3851 


CAGCCACTGG 


AGTGGATGCC ATCTGCACCC ACCACCCTGA CCCCAAAAGC 


3901 


CCTAGGCTGG 


AC AG AG AG C A GCTGTATTGG 


GAGCTGAGCC 


AGCTGACCCA 


3951 


CAATATCACT 


GAGCTGGGCC ACTATGCCCT 


GGACAACGAC 


AGCCTCTTTG 


4001 


TCAATGGTTT 


CACTCATCGG AGCTCTGTGT 


CCACCACCAG 


CACTCCTGGG 


4051 


ACCCCCACAG 


TGTATCTGGG AGCATCTAAG ACTCCAGCCT 


CGATATTTGG 


4101 


CCCTTCAGCT 


GCCAGCCATC TCCTGATACT 


ATTCACCCTC 


AACTTCACCA 


4151 


TCACTAACCT 


GCGGTATGAG GAGAACATGT 


GGCCTGGCTC 


CAGGAAGTTC 


4201 


AACACTACAG 


AGAGGGTCCT TCAGGGCCTG 


CTAAGGCCCT 


TGTTCAAGAA 


4251 


CACCAGTGTT 


GGCCCTCTGT ACTCTGGCTC 


: CAGGCTGACC 


TTGCTCAGGC 
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4301 


CAGAGAAAGA 


TGGGGAAGCC 


ACCGGAGTGG 


ATGCCATCTG 


CACCCACCGC 


4351 


CCTGACCCCA 


CAGGCCCTGG 


GCTGGAC AG A 


GAGCAGCTGT 


ATTTGGAGCT 


4401 


GAGCCAGCTG ACCCACAGCA 


TCACTGAGCT 


GGGCCCCTAC 


ACACTGGACA 


4451 


GGGACAGTCT 


CTATGTCAAT 


GGTTTCACCC 


ATCGGAGCTC 


TGTACCCACC 


A. SOI 


ACCAGCACCG 


GGGTGGTCAG 


CGAGGAGCCA 


TTCACACTGA 


ACT TC AC CAT 


a s c i 
y j j i 


CAACAACCTG 


CGCTACATGG 


CGGACATGGG 


CCAACCCGGC 


TCCCTCAAGT 


4 £n 1 

fi DU X 


TCAACATCAC 


AGACAACGTC 


ATGAAGCACC 


TGCTCAGTCC 


TTTGTTCCAG 


ft DO 1 


AGGAGCAGCC 


TGGGTGCACG 


GTACACAGGC 


TGCAGGGTCA 


TCGCACTAAG 


fi / U X 


GTCTGTGAAG 


AACGGTGCTG 


AGACACGGGT 


GGACCTCCTC 


TGCACCTACC 


/TCI 


TGCAGCCCCT 


CAGCGGCCCA 


GGTCTGCCTA 


TCAAGCAGGT 


GTTCCATGAG 


4 O U 1 


CTGAGCCAGC 


AGACCCATGG 


CATCACCCGG 


CTGGGCCCCT 


ACTCTCTGGA 


ft O D X 


CAAAGACAGC 


CTCTACCTTA ACGGTTACAA TGAACCTGGT 


CTAGATGAGC 


/i q n i 
ft yu x 


CTCCTACAAC 


TCCCAAGCCA 


GCCACCACAT 


TCCTGCCTCC 


TCTGTCAGAA 


ft .J X 


GCCACAACAG 


CCATGGGGTA 


CCACCTGAAG 


ACCCTCACAC 


TCAACTTCAC 


DU U X 


CATCTCCAAT 


CTCCAGTATT 


CACCAGATAT 


GGGCAAGGGC 


TCAGCTACAT 


c n t 


TCAACTCCAC 


CGAGGGGGTC 


CTTCAGCACC 


TGCTCAGACC 


CTTGTTCCAG 


OlUI 


AAGAGCAGCA 


TGGGCCCCTT 


CTACTTGGGT 


TGCCAACTGA 


TCTCCCTCAG 


CI CI 

jl Jl 


GCCTGAGAAG 


GATGGGGCAG 


CCACTGGTGT 


GGACACCACC 


TGCACCTACC 


jZU X 


ACCCTGACCC 


TGTGGGCCCC 


GGGCTGGACA 


TACAGCAGCT 


TTACTGGGAG 


c o cc 1 


CTGAGTCAGC 


TGACCCATGG 


TGTCACCCAA 


CTGGGCTTCT 


ATGTCCTGGA 


DjUI 


CAGGGATAGC CTCTTCATCA ATGGCTATGC 


ACCCCAGAAT 


TTATCAATCC 


53bl 


GGGGCGAGTA 


CCAGATAAAT 


TTCCACATTG 


TCAACTGGAA 


CCTCAGTAAT 


5401 


CCAGACCCCA 


CATCCTCAGA 


GTACATCACC 


CTGCTGAGGG 


ACATCCAGGA 


5451 


CAAGGTCACC 


ACACTCTACA AAGGCAGTCA ACT AC AT G AC ACATTCCGCT 


5501 


TCTGCCTGGT 


CACCAACTTG ACGATGGACT 


CCGTGTTGGT 


CACTGTCAAG 


5551 


GCATTGTTCT 


CCTCCAATTT 


GGACCCCAGC 


CTGGTGGAGC 


AAGTCTTTCT 


5601 


AGATAAGACC 


CTGAATGCCT 


CATTCCATTG 


GCTGGGCTCC 


ACCTACCAGT 


5651 


TGGTGGACAT 


CCATGTGACA GAAATGGAGT 


CATCAGTTTA 


. TCAACCAACA 
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5701 AGCAGCTCCA GCACCCAGCA CTTCTACCCG AATTTCACCA TCACCAACCT 

5751 ACCATATTCC CAGGACAAAG CCCAGCCAGG CACCACCAAT TACCAGAGGA 

5801 ACAAAAGGAA TATTGAGGAT GCGCTCAACC AACTCTTCCG AAACAGCAGC 

5851 ATCAAGAGTT ATTTTTCTGA CTGTCAAGTT TCAACATTCA GGTCTGTCCC 

5901 CAACAGGCAC CACACCGGGG TGGACTCCCT GTGTAACTTC TCGCCACTGG 

5951 CTCGGAGAGT AGACAGAGTT GCCATCTATG AGGAATTTCT GCGGATGACC 

6001 CGGAATGGTA CCCAGCTGCA GAACTTCACC CTGGACAGGA GCAGTGTCCT 

6051 TGTGGATGGG TATTCTCCCA ACAGAAATGA GCCCTTAACT GGGAATTCTG 

6101 ACCTTCCCTT CTGGGCTGTC ATCTTCATCG GCTTGGCAGG ACTCCTGGGA 

6151 CTCATCACAT GCCTGATCTG CGGTGTCCTG GTGACCACCC GCCGGCGGAA 

6201 GAAGGAAGGA GAATACAACG TCCAGCAACA GTGCCCAGGC TACTACCAGT 

6251 CACACCTAGA CCTGGAGGAT CTGCAATGAC TGGAACTTGC CGGTGCCTGG 

6301 GGTGCCTTTC CCCCAGCCAG GGTCCAAAGA AGCTTGGCTG GGGCAGAAAT 

6351 AAACCATATT GGTCG 



Figure 5b (cont) 




Figure 6 
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